Search CORE

10 research outputs found

NetShaper: A Differentially Private Network Side-Channel Mitigation System

Author: Goswami Swati
Lécuyer Mathias
Mehta Aastha
Sabzi Amir
Seltzer Margo
Vora Rut
Publication venue
Publication date: 10/10/2023
Field of study

The widespread adoption of encryption in network protocols has significantly improved the overall security of many Internet applications. However, these protocols cannot prevent network side-channel leaks -- leaks of sensitive information through the sizes and timing of network packets. We present NetShaper, a system that mitigates such leaks based on the principle of traffic shaping. NetShaper's traffic shaping provides differential privacy guarantees while adapting to the prevailing workload and congestion condition, and allows configuring a tradeoff between privacy guarantees, bandwidth and latency overheads. Furthermore, NetShaper provides a modular and portable tunnel endpoint design that can support diverse applications. We present a middlebox-based implementation of NetShaper and demonstrate its applicability in a video streaming and a web service application

arXiv.org e-Print Archive

Packing Privacy Budget Efficiently

Author: Chowdhury Mosharaf
Cidon Asaf
Geambasu Roxana
Kostopoulou Kelly
Lécuyer Mathias
Tholoniat Pierre
Yang Junfeng
Publication venue
Publication date: 26/12/2022
Field of study

Machine learning (ML) models can leak information about users, and differential privacy (DP) provides a rigorous way to bound that leakage under a given budget. This DP budget can be regarded as a new type of compute resource in workloads of multiple ML models training on user data. Once it is used, the DP budget is forever consumed. Therefore, it is crucial to allocate it most efficiently to train as many models as possible. This paper presents the scheduler for privacy that optimizes for efficiency. We formulate privacy scheduling as a new type of multidimensional knapsack problem, called privacy knapsack, which maximizes DP budget efficiency. We show that privacy knapsack is NP-hard, hence practical algorithms are necessarily approximate. We develop an approximation algorithm for privacy knapsack, DPK, and evaluate it on microbenchmarks and on a new, synthetic private-ML workload we developed from the Alibaba ML cluster trace. We show that DPK: (1) often approaches the efficiency-optimal schedule, (2) consistently schedules more tasks compared to a state-of-the-art privacy scheduling algorithm that focused on fairness (1.3-1.7x in Alibaba, 1.0-2.6x in microbenchmarks), but (3) sacrifices some level of fairness for efficiency. Therefore, using DPK, DP ML operators should be able to train more models on the same amount of user data while offering the same privacy guarantee to their users

arXiv.org e-Print Archive

Web Transparency for Complex Targeting: Algorithms, Limits, and Tradeoffs

Author: Chaintreau Augustin
Ducoffe Guillaume
Geambasu Roxana
Lécuyer Mathias
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 15/06/2015
Field of study

International audienceBig Data promises important societal progress but exacerbates the need for due process and accountability. Companies and institutions can now discriminate between users at an individual level using collected data or past behavior. Worse, today they can do so in near perfect opacity. The nascent field of web transparency aims to develop the tools and methods necessary to reveal how information is used, however today it lacks robust tools that let users and investigators identify targeting using multiple inputs. Here, we formalize for the first time the problem of detecting and identifying targeting on combinations of inputs and provide the first algorithm that is asymptotically exact. This algorithm is designed to serve as a theoretical foundational block to build future scalable and robust web transparency tools. It offers three key properties. First, our algorithm is service agnostic and applies to a variety of settings under a broad set of assumptions. Second, our algorithm's analysis delineates a theoretical detection limit that characterizes which forms of targeting can be distinguished from noise and which cannot. Third, our algorithm establishes fundamental tradeoffs that lead the way to new metrics for the science of web transparency. Understanding the tradeoff between effective targeting and targeting concealment lets us determine under which conditions predatory targeting can be made unprofitable by transparency tools

HAL-UNICE

INRIA a CCSD electronic archive server

Vers une plus grande transparence du Web

Author: Chaintreau Augustin
Ducoffe Guillaume
Geambasu Roxana
Lécuyer Mathias
Publication venue: HAL CCSD
Publication date: 02/06/2015
Field of study

International audienceDe plus en plus les géants du Web (Amazon, Google et Twitter en tête) recourent a la manne des « Big data » : ils collectent une myriade de données qu'ils exploitent pour leurs algorithmes de recommandation personnalisée et leurs campagnes publicitaires. Pareilles méthodes peuvent considérablement améliorer les services rendus a leurs utilisateurs, mais leur opacité fait débat. En effet, il n'existe pas a ce jour d'outil suffisamment robuste qui puisse tracer sur le Web l'usage des données et des informations sur un utilisateur par des services en ligne. Motivés par ce manque de transparence, nous avons développé un prototype du nom d'XRay, et qui peut prédire quelle donnée parmi toutes celles présentes dans un compte utilisateur est responsable de la réception d'une publicité. Dans cet article, nous présentons son principe ainsi que les résultats de nos premières expérimentations. Nous introduisons dans le même temps le tout premier modèle théorique pour le problème de la transparence du Web, et nous interprétons les performances d'Xray a la lumière de nos résultats obtenus dans ce modèle. En particulier, nous démontrons qu'un nombre θ(log N) de comptes utilisateurs auxiliaires, remplis selon un procédé aléatoire , suffisent a déterminer quelle donnée parmi les N en présence a causé la réception d'une publicité. Nous aborderons brièvement les extensions possibles, et quelques problèmes ouverts

INRIA a CCSD electronic archive server

Boost: Effective Caching in Differentially-Private Databases

Author: Cidon Asaf
Geambasu Roxana
Kostopoulou Kelly
Lécuyer Mathias
Tholoniat Pierre
Publication venue
Publication date: 28/06/2023
Field of study

Differentially private (DP) databases can enable privacy-preserving analytics over datasets or data streams containing sensitive personal records. In such systems, user privacy is a very limited resource that is consumed by every new query, and hence must be aggressively conserved. We propose Boost, the most effective caching component for linear query workloads over DP databases. Boost builds upon private multiplicative weights (PMW), a DP mechanism that is powerful in theory but very ineffective in practice, and transforms it into a highly effective caching object, PMW-Bypass, which uses prior-query results obtained through an external DP mechanism to train a PMW to answer arbitrary future linear queries accurately and "for free" from a privacy perspective. We show that Boost with PMW-Bypass conserves significantly more budget compared to vanilla PMW and simpler cache designs: at least 1.51 - 14.25x improvement in experiments on public Covid19 and CitiBike datasets. Moreover, Boost incorporates support for range-query workloads, such as timeseries or streaming workloads, where opportunities exist to further conserve privacy budget through DP parallel composition and warm-starting of PMW state. Our work thus establishes both a coherent system design and the theoretical underpinnings for effective caching in DP databases

arXiv.org e-Print Archive

Innate and Adaptive Humoral Responses Coat Distinct Commensal Bacteria with Immunoglobulin A

Author: Albert Bendelac
Alexander L. Dent
Alugupalli
Bana Jabri
Baumgarth
Benjamin D. McDonald
Caporaso
Casola
Cullender
Cunningham-Rundles
Dionysios A. Antonopoulos
Dustin G. Shaw
Fagarasan
Gil-Cruz
Haas
Hollister
Isabel E. Ishizuka
Jason C. Koval
Jeffrey J. Bunker
Kau
Kawamoto
Kroese
Kroese
Kubinak
Lécuyer
Macpherson
Marlies Meisel
Mathew
Mathias
Mestecky
Meyer
Moon
Pabst
Palm
Patrick C. Wilson
Peterson
Robertson
Roy
Slack
Stephens
Tezuka
Theodore M. Flynn
Thurnheer
Tsuji
Tsuruta
Tsuruta
van der Waaij
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Estuarine Lago Mare fauna from the Tertiary Piedmont Basin indicates episodic Atlantic/Mediterranean exchange during the final stage of the Mediterranean Salinity Crisis

Crossref

The antibody/microbiota interface in health and disease

Author: A Haas
A Macpherson
A Nakajima
A Zhernakova
AD Kostic
AL Kau
AN Hegazy
B Routy
BC Jacobs
Benjamin S. Christmann
C Martinoli
CA Brennan
CA Lozupone
CJ Landers
CP Bradley
D Kirkland
DA Peterson
DC Roopenian
Delphine Sterlin
Doua Azzouz
DP Strachan
E Elinav
E Lécuyer
E Oksenhendler
E Slack
Emma Slack
ES Lim
G Bioley
G D’Auria
G Magri
GD Wu
GP Donaldson
Guy Gorochov
H Brenner
H-F Wang
HJM Harmsen
IB Natvig
J Benckert
J Butt
J Fadlallah
J König
J-F Bach
J-F Bach
JCHMPDGW Group
JD Planer
Jeffrey J. Bunker
Jehane Fadlallah
JJ Bunker
JJ Faith
JL Kubinak
JL Leach
JR Wilmore
JS Bajaj
K Suzuki
Kathrin Moor
L Beaugerie
L Mellander
L Winner
LA Waaij
M Descatoire
M Diard
M Dzidic
M Džunková
M Gomez de Agüero
M Kim
M Lin
M Massa
M Pyzik
M Scudellari
MA Koch
Mathias L. Richard
MEV Johansson
MF Cusick
MJ Bonder
MJ Lodes
MR Rubinstein
MY Zeng
N Geva-Zatorsky
NW Palm
OL Rojas
P Brandtzaeg
P Brandtzaeg
P Brandtzaeg
PB Eckburg
R Almansa
RR Bollinger
RS Gaboriau-Routhiau
S Agarwal
S Agarwal
S Bala
S Boullier
S Fagarasan
S Greenblum
S Hapfelmeier
S Kawamoto
S Longet
S Manfredo Vieira
SF Bloomfield
SF Jørgensen
Silje Fjellgård Jørgensen
SJ Forbes
SR Targan
T Rollenske
T Tsuruta
T Vatanen
TM Greiling
VE Ruiz
W Wu
X Liu
Zita Chovancova
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref